An R package for analyzing and modeling ranking data

نویسندگان

Paul H Lee

Philip LH Yu

چکیده

BACKGROUND In medical informatics, psychology, market research and many other fields, researchers often need to analyze and model ranking data. However, there is no statistical software that provides tools for the comprehensive analysis of ranking data. Here, we present pmr, an R package for analyzing and modeling ranking data with a bundle of tools. The pmr package enables descriptive statistics (mean rank, pairwise frequencies, and marginal matrix), Analytic Hierarchy Process models (with Saaty's and Koczkodaj's inconsistencies), probability models (Luce model, distance-based model, and rank-ordered logit model), and the visualization of ranking data with multidimensional preference analysis. RESULTS Examples of the use of package pmr are given using a real ranking dataset from medical informatics, in which 566 Hong Kong physicians ranked the top five incentives (1: competitive pressures; 2: increased savings; 3: government regulation; 4: improved efficiency; 5: improved quality care; 6: patient demand; 7: financial incentives) to the computerization of clinical practice. The mean rank showed that item 4 is the most preferred item and item 3 is the least preferred item, and significance difference was found between physicians' preferences with respect to their monthly income. A multidimensional preference analysis identified two dimensions that explain 42% of the total variance. The first can be interpreted as the overall preference of the seven items (labeled as "internal/external"), and the second dimension can be interpreted as their overall variance of (labeled as "push/pull factors"). Various statistical models were fitted, and the best were found to be weighted distance-based models with Spearman's footrule distance. CONCLUSIONS In this paper, we presented the R package pmr, the first package for analyzing and modeling ranking data. The package provides insight to users through descriptive statistics of ranking data. Users can also visualize ranking data by applying a thought multidimensional preference analysis. Various probability models for ranking data are also included, allowing users to choose that which is most suitable to their specific situations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A modified branch and bound algorithm for a vague flow-shop scheduling problem

Uncertainty plays a significant role in modeling and optimization of real world systems. Among uncertain approaches, fuzziness describes impreciseness while for ambiguity another definition is required. Vagueness is a probabilistic model of uncertainty being helpful to include ambiguity into modeling different processes especially in industrial systems. In this paper, a vague set based on dista...

متن کامل

Rankcluster: An R Package for clustering multivariate partial ranking

Rankcluster is the first R package dedicated to ranking data. This package proposes modelling and clustering tools for ranking data, potentially multivariate and partial. Ranking data are modelled by the Insertion Sorting Rank (isr) model, which is a meaningful model parametrized by a central ranking and a dispersion parameter. A conditional independence assumption allows to take into account m...

متن کامل

seqMeta: an R Package for meta-analyzing region-based tests of rare DNA variants

Region-based tests are becoming a popular tool for analyzing rare genetic variants. In order for these tests to have adequate power, it is often necessary to meta-analyze information from multiple contributing studies, where consent restrictions make it difficult or impossible to share individual level data. We present the R package seqMeta for meta-analyzing region based tests, such as SKAT, S...

متن کامل

seqMeta: an R Package for meta-analyzing region-based tests of rare DNA variants

متن کامل

seqMeta: an R Package for meta analyzing region-based tests of rare DNA variants

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 13 شماره

صفحات -

تاریخ انتشار 2013

An R package for analyzing and modeling ranking data

نویسندگان

چکیده

منابع مشابه

A modified branch and bound algorithm for a vague flow-shop scheduling problem

Rankcluster: An R Package for clustering multivariate partial ranking

seqMeta: an R Package for meta-analyzing region-based tests of rare DNA variants

seqMeta: an R Package for meta-analyzing region-based tests of rare DNA variants

seqMeta: an R Package for meta analyzing region-based tests of rare DNA variants

عنوان ژورنال:

اشتراک گذاری